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(57) Abstract: The sound processor and method uses a model of basilar membrane motion to select stimuli, based upon the pre- 
dicted motion which the acoustic signal presented would produce in an acoustically excited normally hearing cochlea. The filters 
used, in contrast to single channel per electrode approaches, cover multiple channels and overlap with each other. Consequently, 
the stimuli presented produce a neural excitation pattern which approximates the spatio-temporal travelling wave observed on the 
basilar membrane in an acoustically excited normally hearing cochlea. Preferably, the predicted electrode stimuli are based upon the 
instantaneous predicted amplitude of the electrode location. 
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SOUND PROCESSOR FOR A COCHLEAR IMPLANT 

Technical Field 

The present invention relates to cochlear implants, and to sound 
processing devices and methods relating to cochlear implants. 

5 

Background Art 

In normal hearing, sound causes mechanical vibrations that stimulate the 
hair cells of the cochlea to produce electrical impulses that travel down the 
auditory nerve where they are perceived by the brain as sound. If for some 

10 reason these hair cells are destroyed or not present within the cochlea, as is the 
case with individuals with severe or profound hearing loss, the nerve cells do not 
receive this electrical stimulation, therefore no sound is perceived. A cochlear 
implant attempts to replace this lost function by providing artificial electrical 
stimulation of the surviving auditory nerve. Cochlear implants have been in clinical 

15 use for many years. Such devices use an array of implanted electrodes to provide 
electrical stimuli to the cochlea. The electrical stimuli are determined by a 
processor responsive to speech and sound signals in the environment of the 
user. 

Historically, prior to around 1994, the majority of speech processors used 
20 in conjunction with a cochlear implant employed speech processing strategies 
that can be described as Feature Extraction Strategies. In such strategies, the 
associated implant hardware attempts to identify the speech features present in 
the detected sound signal and encodes such features as patterns of electrical 
stimulation. Feature extraction strategies have the advantage that the hardware 
25 required to perform the feature extraction is relatively simple and consumes a 
relatively low amount of power. 

With improvements in silicon chip technology and an increased knowledge 
of the safety of electrical stimulation, a new approach in sound processing 
became possible. This approach had the ability to provide a full range of spectral 
30 information of the speech signal without the need for the hardware to fit the signal 
into a preconceived mould, giving the patient the opportunity to listen to the 
particular information of interest, within background noise, providing a more 
realistic approach to speech processing. Such sound processors use band-pass 
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filters to separate acoustic signals into frequency bands or spectral components 
with relatively little overlap of the bands, with the electrodes being stimulated in a 
tonotopic fashion according to the energy in those bands. Usually they present a 
smoothed (low-pass-filtered) representation of the amplitude from each band to a 
5 single electrode. 

Despite considerable practical success with each of the existing schemes, 
the user perceptions of existing devices indicate that there are significant 
outstanding problems. Three fundamental problems of sound perception reported 
by cochlear implant users are poor frequency resolution and discrimination, poor 
10 perception of speech in noise at low signal-to-noise ratios, and poor perception of 
musical sounds. 

It is an object of the present invention to provide an alternative speech 
processor and processing method, in order to further improve the practical 
performance of the cochlear implant system. 

15 

Summary of the Invention 

In broad terms, the present invention provides a fundamental change to 
the traditional approach used in sound processing for cochlear implants. Instead 
of attempting to separate acoustic information into discrete frequency bands or 

20 channels, the inventive processor produces electrical stimulation patterns that 
excite broad overlapping regions of the cochlea. It is believed that the approach 
of the present invention will provide a better approximation to the behavior of the 
auditory structures during hearing by a normally hearing listener. Current 
processors produce localised stimuli based upon the frequency of components of 

25 the sound signal. In contrast, the present invention seeks to approximate the 
spatio-temporal neural excitation patterns which are induced by the motion of the 
basilar membrane as a response to sound stimuli in the normally hearing listener. 
The present invention seeks to produce a spatio-temporal pattern of stimulation 
along the length of an intra-cochlea electrode array, as opposed to merely 

30 localised stimuli. 

According to a first aspect, the present invention resides in a method of 
processing sound signals in order to generate electrical stimuli for an auditory 
prosthesis whereby a neural excitation pattern is produced which mimics the 



WO 01/99470 



3 



PCT/AU01/00723 



spatio-temporal pattern associated with the travelling wave observed on the 
basilar membrane in an acoustically excited normally-hearing cochlea. 

According to another aspect, the present invention provides a sound 
processor for use in a cochlear implant system, said sound processor being of the 
5 type which receives sound signals from a microphone or the like, processes said 
signals according to a predetermined instruction set, and provides stimulation 
instructions for an implanted electrode array, characterised in that the 
predetermined instruction set produces stimulus instructions which are intended 
to provide an approximation to the spatio-temporal waveforms induced in 
10 response to said sound signals on the basilar membrane of a normal hearing 
listener. 

According to yet another aspect, the present invention provides a method 
of processing sound signals so as to produce stimulus instructions for a cochlear 
implant, including the steps of 
15 deriving the vector of complex Fourier transform coefficients for a data 
sample; 

multiplying the vector of the coefficients by a complex matrix representing 
the amplitude and phase of the Fourier frequency components at the position of 
the electrodes in the cochlea relative to the amplitude and phase at the stapes in 

20 a normal cochlea to produce an output vector; and 

converting the output vector values to electrode current levels. 
Travelling wave aspects of basilar membrane response have been 
observed and reported on in investigations of normal auditory processes. 
However, there has been no previous attempt to utilise these phenomena as an 

25 element of stimulus processing for cochlear implants. The travelling wave may be 
thought of as a 3-Dimensional pattern in which the dimensions are time, distance 
along the basilar membrane, and displacement of the basilar membrane. The 
properties of these patterns that are thought to be important (and different from 
existing processor outputs) include a diagonal ridge structure of the 3D pattern, 

30 the dynamic nature of the ridge pattern that sweeps across the cochlear electrode 
array at a particular velocity that depends on position, the smoothly varying 
nature of the pattern in both space and time, and the maintenance of naturally- 
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occurring phase and amplitude relationships between the stimulation patterns on 
individual electrodes. 

Brief Description of Drawings 

5 The implementation of the present invention will be described in more 
detail with reference to the accompanying drawings, in which: 
Figure 1 is a graph illustrating a travelling wave excitation pattern produced in 
response to a pure tone of frequency 200 Hz; 

Figure 2 is a graph illustrating a travelling wave excitation pattern for tones at 200 
10 and 600 Hz; 

Figure 3 is a graph illustrating a travelling wave excitation pattern for tones at 470 
and 200 Hz; 

Figure 4 is a schematic illustration of one implementation of the invention; 
Figure 5 is a block diagram of one speech processing implementation of the 
15 present invention; 

Figure 6 is a graph of the typical frequency response of a 22-channel filterbank; 
Figure 7 is a graph depicting the typical delay of each of each of the 22 channels; 
Figure 8 is a block diagram showing the sub-components of the Electrical 
Encoding Component of Figure 5. 

20 

Detailed Description 

The present invention will be described with reference to the hardware 
implementation used by the applicant, using an implanted receiver/stimulator unit 
and an external speech processor and microphone. However, the present 

25 invention is of broad scope and can be implemented on any sufficiently 
sophisticated cochlear implant system. In particular, it is anticipated that the 
present invention will be able to be implemented more fully and in more detail on 
future generations of cochlear implants, with increased processing power and 
flexibility relative to the current state of the art. The present invention could also 

30 be implemented in a totally implanted device, or some intermediate stage 
between the present systems and a totally implanted device. 

The essential difference between the invention described and previous 
implant coding schemes is that the importance of overlapping information across 
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electrodes is recognized and a complex spatio-temporal pattern is produced. 
This pattern preserves, at least in part, the detailed amplitude and phase 
relationships between different positions that occur normally in an intact cochlea. 
These amplitude and phase relationships vary smoothly as a function of position 
5 along the cochlea to produce the acoustic "travelling wave" (von Bekesy, 1961). 
Instead of attempting to separate acoustic information into discrete frequency 
bands or channels, the travelling wave processor produces electrical stimulation 
patterns that excite broad overlapping regions of the cochlea. As an example, 
Figure 1 shows the excitation pattern that is produced by the inventive travelling 

10 wave processor in response to a pure tone input signal. Each line in the Figure 
shows the shape of the excitation function at an instant in time. The horizontal 
axis represents position along the cochlea from the basal (high frequency) end on 
the left to the apical (low frequency) end on the right. 

In contrast, the excitation pattern produced by the same pure tone input 

15 signal after processing by any of the currently used cochlear implant coding 
schemes would be localized to one position in the cochlea and represented by a 
narrow ridge running vertically up the page in a Figure analogous to Figure 1 . In 
some cochlear implant coding schemes the amplitude of the vertical ridge would 
be represented by electrical pulses at a fixed rate and level (ACE, SPEAK), or 

20 modulated at a rate equal to the frequency of the (low-frequency) input tone 
(F0F2, F0F1F2, MPEAK, CIS, SAS). In the latter cases, the excitation pattern is 
effectively a vertical slice taken from the pattern of Figure 1 at the point where the 
amplitude of the pattern is greatest. 

It is hypothesized that the auditory pathways of the brain have specialized 

25 perceptual mechanisms designed to recognize characteristics of 3-dimensional 
(position X time X amplitude) excitation patterns like those shown in Figure 1 . It is 
known that the visual and tactile sensory systems have neural mechanisms 
specialized to detect the orientation and spacing of stripes and ridges. The 
auditory system may be programmed in an analogous manner to recognise the 

30 diagonal ridges of patterns like the one in Figure 1 . If an incomplete pattern is 
presented to the cochlea (as in all prior art cochlear implant coding schemes) 
these perceptual mechanisms will be engaged but will not operate as effectively 
as they would if the whole pattern was presented. It is hypothesized that the 
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presentation of complete 3D patterns will improve the perception of sound by 
cochlear implant users and overcome or reduce some of the problems that are 
common to all prior art cochlear implant systems. Relative to normal hearing 
listeners, the problems for implant users include poor frequency discrimination 
5 and resolution abilities, poor speech recognition of speech in background noise, 
and lack of musical quality for processed musical sounds. 

When the stimulation is viewed as a 3D pattern, several consequences 
become more apparent: 

a) A whole pattern is easier to recognize than a partial pattern because 
10 it contains more information. This characteristic is also important for 

grouping sound components from the same source. Some of the temporal 
and spatial coherence of excitation patterns arising from a single sound 
source is lost when the signal is band-pass filtered into separate 
components that are encoded independently of one another. Conversely, 

15 sounds from different sources will be easier to separate if each one gives 

rise to a whole pattern, rather than a number of independent components 
which must be recombined by the perceptual mechanisms into an 
unknown number of sound sources. In the particular case of speech in 
background noise, it is important that the speech and the noise both 

20 produce complete 3D excitation patterns so that the perceptual 

mechanisms can use this information to allocate components of the 
combined pattern more easily to the noise or to the speech. 

b) Dynamic patterns are easier to recognise than stationary ones. It is 
well-known that the tactile system provides increased information about 

25 texture, shape and edges of objects if the fingers are moved over the 
surface of the object than if a static contact is made. In a similar way, the 
spacing of ridges in the 3D auditory excitation pattern may be enhanced 
perceptually by sweeping them along the cochlea. Similarly, onsets and 
offsets of sounds correspond to edges in the 3D pattern, and these may be 

30 perceived more clearly as they move along the cochlea, rather than just 
appearing with different amplitudes at different parts of the cochlea and 
then disappearing again. Thus the presentation of dynamic patterns may 
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improve frequency discrimination and resolution and perception of onsets 
and offsets of sounds with complex spectra. 

c) If a pattern is known to vary smoothly and regularly, missing 
sections can be interpolated or filled in. For example, one can "see" what 

5 is on the other side of a paling fence as one walks past even though most 

of the scene is obscured by the fence at any one time. This is because the 
visual system is able to reconstruct the continuous picture from the parts 
that are viewed at separate instants in time. In the case of auditory signals 
that are obscured by noise, parts of a smoothly varying, regular speech 

10 pattern may be perceived through temporal and spectral gaps in the noise 

and reconstructed in an analogous manner. However, if the speech and 
noise patterns do not vary smoothly with position, this reconstruction is 
much more difficult. This is a potential explanation for the fact that implant 
users are unable to recognize speech in noise when the signal-to-noise 

15 ratio is close to zero. The travelling wave processor may allow listeners to 
reconstruct lower amplitude speech signals even when they are partially 
obscured by more intense noise signals, provided that there are some 
temporal or spectral gaps in the noise signal. 

d) Tone complexes with harmonically related components produce 3D 
20 patterns with special characteristics in the regions where the tonal patterns 

overlap. These characteristics are not present in anharmonic complexes. 
They are also not present in the excitation patterns produced by existing 
cochlear implant sound processors because they do not produce 
overlapping patterns for individual tones separated by an octave or more. 
25 For example, Figure 2 shows the excitation pattern for 2 tones at 200 and 
600 Hz. In the overlap region, every third diagonal ridge is larger than the 
others because of the constructive addition of the two excitation patterns. 
Figure 3 shows an anharmonic combination where there is no regular 
summation of the two patterns in the overlap region. 
30 The travelling wave in normal hearing has been recognised and discussed 
in the scientific literature. However, this literature has had virtually no effect on the 
design of cochlear implants or hearing aids as far as the inventors are aware. 
One explanation for this is that the frequency response of the cochlea to 



WO 01/99470 



8 



PCT/AU01/00723 



sinusoidal signals is highly peaked and implant and hearing aid designers have 
chosen to ignore the low-frequency tails of the frequency response curves. The 
closest existing technologies are cochlear implant sound coding schemes that 
measure spectral characteristics of input signals with bandpass filters and 
5 represent them by stimulating individual electrodes. 

The preferred implementation of the present invention utilizes a digital- 
sign al-p roces so r to calculate an approximate travelling wave excitation pattern 
from a digitised input signal. The travelling wave pattern is essentially a 
specification of the displacement of each point on the basilar membrane of the 

1 0 cochlea as a function of time and position. The implementation is based directly 
on published experimental data from normally-hearing human subjects rather 
than theoretical models of basilar membrane mechanics. The implementation is 
also simplified to make it feasible for real-time implementation and to make it 
easier to parameterize the fitting procedure for individual cochlear implant users. 

15 Ope embodiment of the system according to the present invention is 

shown in figure 4, with the components of the system as listed below: 

1 . Microphone 1 1 to convert an acoustic input signal to an electrical 
signal, 

2. Preamplifier/Automatic Gain Control 12 to amplify and control the 
20 level of the electrical signal. 

3. Analog-to-Digital-Converter 13to convert the electrical signal to a 
stream of digital samples. 

4. Digital-Signal-Processor 14 to calculate the travelling wave pattern 
and convert it to an electrical stimulus pattern. 

25 5. Programmable Memory 15 to store patient-specific parameters, the 
processor programs, and intermediate results in calculating the travelling- 
wave pattern. 

6. Output Signal Generator 1 6 to control a cochlear implant and deliver 

the electrical stimulus to the implant patient. 
30 A simpler version of the present invention, in particular the digital signal 
processor, is shown in figure 5. In this particular aspect, the invention essentially 
resides in a system consisting of four main parts, a Basilar Membrane Motion 
Model which accepts an audio signal as input and calculates the displacement or 
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velocity of the basilar membrane at each electrode position; an Inner Hair Cell 
Model which calculates the amount of neural excitation at each electrode position 
based on the information received from the Basilar Membrane Motion Model; an 
Outer Hair Cell Model which provides a feedback path that takes into 
5 consideration the amount of neural excitation calculated by the Inner Hair Cell 
Model and the affects such excitation will have on the Basilar Membrane Motion 
Model; whereby the amount of neural excitation affects the response of the 
Basilar Membrane Motion Model; and an Electrical Encoding Component which 
calculates the pattern of electrical stimulation which will provide the desired neural 
10 excitation pattern. Each of these 4 components will be described in more detail 
below. 

Basilar Membrane Motion Model 

The Basilar Membrane Motion Model accepts an audio signal as input and 
15 calculates the displacement or velocity of the basilar membrane at each electrode 
position in relation to the audio signal. 

One possible embodiment of the Basilar Membrane Motion Model consists 
of the following steps, which are repeated continuously: 

1 The input audio signal is divided into short overlapping frames. Each 
20 frame contains L consecutive samples of the audio signal, and is defined 

as the column vector X1 . A suitable length is L = 128. Each frame heavily 
overlaps with the previous frame, and contains K new data points. A 
suitable value is K= 1. 

2 Multiply the input frame vector X1 point-by-point by a window vector 
25 W, resulting in a column vector X2 of length L, according to: 

X2(n) = X1 (n) * W(n) for n = 0 to L - 1 . 
A suitable window function is the Hann function, defined as: 

W(n) = 0.5 * (1 -cos(2 * n* pi/ L)) for n = 0 to L- 1. 

3 Calculate the L-point Fast Fourier Transform (FFT) of the column 
30 vector X2. This results in a column vector X3 of length L, with complex 

values. Because X2 is real, X3 has Hermitian symmetry, and the last L/2 
samples can be discarded (or not calculated). From the first L/2 samples, 
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only the real parts are required and the imaginary parts are discarded (or 
not calculated). The output is a real column vector X4 of length L/2. 
4 Multiply column vector X4 by a rectangular weights matrix G, 
according to: 

5 X5 = G*X4. 

The weights matrix G has N rows and U2 columns. The output is a column 
vector X5 of length N, where N is the number of channels. The weights 
matrix G determines the frequency magnitude response of each channel, 
and is further described below. 
1 0 5 Delay each channel by a time delay specified by column vector D, 

which has length N, according to the formula: 

X6(k, t) = X5(k, t - D(k)) for k = 1 to N 
Typical delays for a 22-channel processor are shown in Figure 7. The 
delay varies from zero delay at channel 1 (the most basal channel) to 8 
15 milliseconds at channel 22 (the most apical channel). 

The output is a column vector X6 of length N, where each element is a 
sample of one channel of the Basilar Membrane Motion Model. 

Each row of the matrix W represents the amplitude and phase of the FFT 
frequency components at the position of one of the electrodes in the cochlea 
20 relative to the amplitude and phase at the stapes (the input to the cochlea). The 
phase difference is equal to 2 pi times the time taken for the travelling wave to 
travel from the stapes to the position of the electrode multiplied by the frequency 
of the FFT component. The amplitude difference between the stapes and the 
electrode position is proportional to the response of the basilar membrane at the 
25 position of the electrode to a pure tone at the frequency of the FFT component (or 
alternatively, the tuning curve of a neuron at the position of the electrode). The 
amplitude coefficients at each electrode position have a peaked shape with the 
maximum at the FFT frequency closest to the characteristic frequency at the 
individual electrode position, and the amplitudes of FFT coefficients higher than 
30 this frequency fall rapidly to zero. 

An alternative way of implementing the delays in this system is by shifting 
the FFT window back in time by a different amount for each electrode. If the shift 
is chosen to be equal to the time taken for the travelling wave to travel from the 
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stapes to the electrode position, then the coefficients of the matrix W are all real 
(Ie the phase is zero for all FFT components). 

The weights matrix G can be calculated according to the following steps: 
1 . The characteristic frequency of each channel is determined based 
5 on the position of the electrodes in the cochlea, according to Greenwood's 
formula.A further correction to the should be applied to account for the fact 
that electrodes at a particular position in the cochlea actually stimulate 
neurons with a lower characteristic frequency than that predicted by 
Greenwood's formula (Blarney PJ, Dooley GJ, Parisi ES and Clark GM., 

1 0 Pitch comparisons of acoustically and electrically evoked auditory 

sensations. Hearing Research, 99, 139-150, 1996; James C, Blarney PJ, 
Shallop JK, Incerti PV & Nicholas AM. Contralateral masking in cochlear 
implant users with residual hearing in the non-implanted ear, audiology and 
Neuro-Otology, 6, 87-97, 2001). This correction factor implies that the 

1 5 effective distance of the electrode from the stapes is greater by a factor of 
2.625 / 1 .875. Alternatively, for subjects who have previously used another 
sound processor and have become accustomed to a particular frequency- 
to-electrode map, those frequencies can be used. The characteristic 
frequencies are stored in a vector C of length N. 

20 2. The centre frequency of each FFT bin is calculated and stored in 
vector B, of length U2. 

3. The weights matrix element G(k, b) represents the gain of channel k 
at the centre frequency of FFT bin b, and can be calculated according to 
the formula: 
25 ifB(b)<=C(k) 

G(k,b) = {B(b)/C(k)}AE 
else 

G(k, b) = 0 

where the symbol " A " means "to the power of" and the parameter E is 
30 called the gain exponent. Suitable values for E are in the range 1 to 5. 

Suitable choices for characteristic frequencies for 22 channels, with a gain 
exponent E = 1 result in the magnitude response shown in Figure 6. 
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The amplitude of each FFT component at each electrode position is 
represented by the magnitude of the corresponding element in matrix W. These 
amplitudes may be estimated from psychophysical tuning curves in humans with 
normal hearing (Zwicker, E. On a psychophysical equivalent of tuning curves. In 
5 Zwicker E. & Terhardt E (eds) Facts and models in hearing. pp132-141, Berlin: 
Springer-Verlag, 1974), from estimates of excitation in the loudness models of 
Zwicker ( Zwicker E. Masking and psychological excitation as consequences of 
the ear's frequency analysis, in Plomp R & Smoorenburg GF (Eds) Frequency 
analysis and periodicity detection in hearing, pp376-96, Leiden: AW Sijthoff, 
10 1970.) or Moore & Glasberg (Moore BCJ & Glasberg BR. A model of loudness 
perception applied to cochlear hearing loss. Auditory Neuroscience 3, 289-311, 
1997) or from an approximation or from an empirical function designed to 
optimise the travelling wave processor for individual implant users. 
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Inner Hair Cell Model 

The Inner Hair Cell Model calculates the amount of neural excitation at 
each electrode position based on the displacement or velocity of the basilar 
membrane at each electrode position in relation to the audio signal as calculated 
5 by the Basilar Membrane Motion Model as discussed above. A simple 
embodiment of the Inner Hair Cell Model is a half-wave rectifier, with other 
embodiments possible as would be obvious to those skilled in the art. The half 
wave rectification mimics the response of the hair cells in a normal cochlea. The 
amplitude of the half-wave rectified travelling wave at each electrode position is 

10 represented by the current level (or electric charge, or pulse width) of an electric 
pulse on that electrode. This mapping from amplitude to electrical stimulation 
parameters differs from conventional cochlear implant mapping in that the 
instantaneous amplitude of the travelling wave is represented rather than a 
smoothed amplitude or intensity which is averaged over a time window of several 

15 milliseconds. Conventional processors code the amplitude envelope rather than 
the instantaneous amplitude, and in doing so, they lose much of the temporal 
information carried by the signal itself. The coding of instantaneous amplitude is 
especially important to the travelling wave processor because coding envelope 
information would merely smear out the information from different frequency 

20 components rather than providing the detailed timing information illustrated in 
figures 1-3. In particular the 3-dimensional ridges would become much broader in 
both spatial and temporal dimensions, and would lose their dynamic 
characteristics. 

25 Outer Hair Cell Model 

The Outer Hair Cell model aims to emulate the non-linearity that is observed in 
the response of a person with normal hearing. This is performed by providing a 
feedback path to the Basilar Membrane Motion Model which takes into 
consideration the proposed neural excitation pattern and the affects such a 
30 pattern has on the response of the Basilar Membrane Motion Model. The output 
of the Inner Hair Cell Model is an estimate of the neural excitation pattern that 
would be present in a person with normal hearing. It has been found that the gain 
for low-amplitude audio signals is greater than the gain for large-amplitude audio 
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signals. This component is optional and may be omitted in a simplified 
implementation. 

Electrical Encoding Component 

5 The Electrical Encoding component calculates the pattern of electrical stimulation 
that will provide the desired neural excitation pattern. There are several possible 
embodiments of the Electrical Encoding component and some components that 
are used in the prior art of cochlear implant processors can be used to perform 
this function according to the present invention. It is important to note that it is the 

10 instantaneous amplitude of the waveform at each electrode position which is 
coded as the current level (or electric charge or pulse width) of an electric pulse 
on that electrode. This differs greatly from prior art systems where it is the time- 
averaged amplitude envelope of the waveform which constitutes what is coded as 
the current level of an electric pulse on that electrode. In essence, the conversion 

15 is effected by means of a function relating the amplitude to electric current level 
derived from prior measurements for each electrode which may be stored in the 
memory 15. 

The present invention can be used with implants that allow both 
simultaneous and/or non-simultaneous stimulation. If the invention is used on an 

20 implant that stimulates channels simultaneously, the travelling wave amplitudes at 
individual electrode positions can be represented by simultaneous electric 
currents (analog rather than pulsatile stimuli) on each individual electrodes. 

If the invention is used with an implant that stimulates channels 
sequentially (non-si multaneously), then the Electrical Encoding component can 

25 be divided into two sub-components as illustrated in Figure 8. The Sampler 
component samples the neural excitation pattern such that each output sample 
produces one electrical pulse. The Amplitude Mapping component calculates the 
electrical parameters of the pulse, such as current level and pulse width. 

One simple embodiment of the Sampler component is taken from the well- 

30 known Continuous Interleaved Sampling (CIS) processor. The neural excitation 
pattern is sampled in a round-robin fashion at a uniform rate on each channel, so 
the sampling rate is equal to the stimulation rate on each channel. The samples 
are interleaved across channels so that the electrical pulses are sequential (non- 
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overlapping). The rate must be sufficiently high so that the time waveform of the 
neural excitation on each channel is adequately represented. Typically this 
requires more than 1000 pulses per second on each channel. 

Note that in a standard CIS processor the filters are designed to be non- 
5 overlapping and relatively narrow, and the smoothed envelope of the filter outputs 
are sampled. In contrast, the present invention has broad, heavily overlapping 
filters and the instantaneous amplitude of the half-wave rectified filter output is 
sampled. 

The CIS Sampler embodiment has the disadvantage that high stimulation 
10 rates are required. An alternative embodiment, which is new in this invention, is 
called the Time Interval Maxima Sampler. It reduces the total stimulation rate that 
is required. It has the following steps: 

1. The neural excitation pattern is divided into short non-overlapping 
time intervals. Each time interval can be represented as a matrix X that has 

15 N columns, where N is the number of channels, and T rows, where T is the 

number of time samples of the neural excitation pattern in each time 
interval. The duration of the time interval is equal to the time taken to 
output a number M of electrical pulses, where M is less N. 

2. In each time interval the maximum value of each channel is 
20 calculated, i.e. the maximum of the matrix X across the rows. The output is 

a column vector Y with N columns, one for each channel. 

3. The amplitudes of the N samples in column vector Y are examined, 
and the M largest samples are retained. Each of these M samples 
produces one electrical pulse. The pulses are output in the next time 

25 interval. 

The Amplitude Mapping component can be the same as that used in the 
prior art Continuous Interleaved Sampling (CIS) processor or Spectral Maxima 
Sound Processor (SMSP). It has the following steps: 

1. The amplitude of each sample is compressed by a non-linear 
30 function known as a loudness growth function, which typically has a 

logarithmic shape. Each output P represents a proportion of the electrical 

dynamic range. 
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2. The current level L of each electrical pulse is calculated from the 
output P and the previously measured threshold T and maximum 
comfortable level C (for that channel) as: 

L = T + (C-T)*P 

5 Following this, the electrode(s) to be stimulated are selected, and the 
output signal generator 16 is fed the data required to produce the electrical 
stimulus pulses. 

It will be appreciated that there are various ways of implementing the 
present invention, for example using circuitry to provide the travelling wave type 
10 stimuli, which are included within the scope of the present inventive concept. 
Variations and additions are also possible within the general inventive concept 
disclosed. 
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THE CLAIMS: 

1. A method of processing sound signals for use in generating electrical 
stimuli for an auditory prosthesis, said method including the steps of processing a 
sound signal according to a predetermined instruction set to produce a set of 
stimulus instructions for an implanted electrode array, characterised in that the 
predetermined instruction set operates so as to produce a set of stimulus 
instructions which, when presented by said electrode array, produces a neural 
excitation pattern which approximates the spatio-temporal pattern associated with 
the travelling wave observed on the basilar membrane in an acoustically excited 
normally-hearing cochlea when stimulated with said sound signal. 

2. A method according to claim 1 wherein said predetermined instruction set 
calculates the travelling wave amplitude variations at each electrode position 
using broad overlapping filters which are not specific only to the frequencies 
associated with each electrode position, said filters having a shape which is 
based on one of either the amplitude envelope of the acoustic travelling wave in a 
normal cochlea, or the shape of the psychophysical tuning curves in a normal 
cochlea. 

3. A method according to claim 2, wherein said method includes the steps of 
sampling said sound signal to produce a data sample; 

deriving the vector of complex Fourier transform coefficients for said data 
sample; 

multiplying the vector of the coefficients by a complex matrix representing 
the amplitude and phase of the Fourier frequency components at the position of 
the electrodes in the cochlea relative to the amplitude and phase at the stapes in 
a normal cochlea to produce an output vector; and 

converting the output vector values to a set of stimulus instructions. 

4. A method according to claim 2 where the group delay of said filters is 
adjusted taking into account the time taken for an acoustic signal to propagate from 
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the stapes to each electrode position in an acoustically excited normally-hearing 
cochlea. 

5. A method according to claim 1 , wherein said predetermined instruction set 
includes a model of basilar membrane motion, said model including determining 
the phase and amplitude of said sound signal at each electrode position in an 
acoustically excited normally-hearing cochlea when stimulated with said sound 
signal. 

6. A method according to claim 5, wherein said predetermined instruction set 
includes an inner hair cell model, which calculates the level of neural excitation at 
each electrode position in an acoustically excited normally-hearing cochlea when 
stimulated with said sound signal. 

7. A method according to claim 6, wherein said model is a half-wave rectifier. 

8. A method according to claim 5, wherein said predetermined instruction set 
includes an outer hair cell model, such that the effect of the proposed set of 
stimulus instructions on the response of the basilar membrane is fed back to said 
basilar membrane model. 

9. A method according to claim 1 , wherein the set of stimulus instructions is 
based upon the instantaneous amplitude of the waveform at each electrode 
position. 

10. A method according to claim 9, wherein the set of stimulus instructions 
provides a stimulus on an electrode when the calculated travelling wave at that 
electrode position reaches a local maximum. 

11. A method according to claim 10, wherein if the calculated travelling wave 
at the position of more than one electrode reaches a local maximum value at the 
same time, the electrode with the larger amplitude value will be stimulated first. 
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12. A method according to claim 1, in which the predetermined instruction set 
further includes specific empirical modifications to improve the perception of 
speech or music in users. 

13. A sound processor for use in a cochlear implant system, said sound 
processor being of the type which receives sound signals, processes said signals 
according to a predetermined instruction set, and provides stimulation instructions 
for an implanted electrode array, characterised in that the predetermined 
instruction set operates so as to produce a set of stimulus instructions which, 
when presented by said electrode array, produces a neural excitation pattern 
which approximates the spacio-temporal pattern associated with the travelling 
wave observed on the basilar membrane in an acoustically excited normally- 
hearing cochlea when stimulated with said sound signal. 

14. A sound processor according to claim 13, wherein said predetermined 
instruction set calculates the travelling wave amplitude variations at each 
electrode position using broad overlapping filters which are not specific only to the 
frequencies associated with each electrode position, said filters having a shape 
which is based on one of either the amplitude envelope of the acoustic travelling 
wave in a normal cochlea, or the shape of the psychophysical tuning curves in a 
normal cochlea. 

15. A sound processor according to claim 14, wherein said predetermined 
instruction set samples said sound signal to produce a data sample; derives the 
vector of complex Fourier transform coefficients for said data sample; multiplies 
the vector of the coefficients by a complex matrix representing the amplitude and 
phase of the Fourier frequency components at the position of the electrodes in 
the cochlea relative to the amplitude and phase at the stapes in a normal cochlea 
to produce an output vector; and converts the output vector values to a set of 
stimulus instructions. 

1 6. A sound processor according to claim 9 wherein the group delay of said 
filters is adjusted taking into account the time taken for an acoustic signal to 
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propagate from the stapes to each electrode position in an acoustically excited normally- 
hearing cochlea. 

1 7. A sound processor according to claim 1 3, wherein said predetermined instruction 
set includes a model of basilar membrane motion, said model including determining the 
phase and amplitude of said sound signal at each electrode position in an acoustically 
excited normally-hearing cochlea when stimulated with said sound signal. 

18. A sound processor according to claim 1 7, wherein said predetermined instruction 
set includes an inner hair cell model which calculates the level of neural excitation at 
each electrode position Jn an acoustically excited normally-hearing cochlea when 
stimulated with said sound signal. 

19. A sound processor according to claim 18, wherein said model is a half-wave 
rectifier. 

20. A sound processor according to claim 1 7, wherein said predetermined instruction 
set includes an outer hair cell model, such that the effect of the proposed set of stimulus 
instructions on the response of the basilar membrane is fed back to said basilar 
membrane model. 

21 . A sound processor according to claim 1 3, wherein the set of stimulus instructions 
is based upon the instantaneous amplitude of the waveform at each electrode position. 

22. A sound processor according to claim 21 , wherein the set of stimulus instructions 
provides a stimulus on an electrode when the calculated travelling wave at that electrode 
position reaches a local maximum. 

23. A sound processor according to claim 1 3, in which the predetermined instruction 
set further includes specific empirical modifications to improve the perception of speech 
or music in users. 
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24. A method according to claim 13, wherein the set of stimulus instructions 
provides a stimulus on an electrode when the calculated travelling wave at that 
electrode position reaches a local maximum. 

25. A method according to claim 14, wherein if the calculated travelling wave 
at the position of more than one electrode reaches a local maximum value at the 
same time, the electrode with the larger amplitude value will be stimulated first. 
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